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Abstract 

The influence of systematic errors on the calculation of the statistical significance 
of a 7-ray signal with the frequently invoked Li and Ma method is investigated. 
A simple criterion is derived to decide whether the Li and Ma method can be 
applied in the presence of systematic errors. An alternative method is discussed 
for cases where systematic errors are too large for the application of the original 
Li and Ma method. This alternative method reduces to the Li and Ma method 
when systematic errors are negligible. Finally, it is shown that the consideration 
of systematic errors will be important in many analyses of data from the planned 
Cherenkov Telescope Array. 

Keywords: Gamma-Ray Astronomy, Systematic Errors, Statistical 
Significance 


1. Introduction 

The calculation of the statistical significance of a measured signal is a cen¬ 
tral part of a data analysis in Very High Energy (VHE) 7-ray astronomy. In a 
typical analysis, the number of events, Nq n, that are detected in a signal region 
is compared to the number of events, o-ZVoffi that are expected in the signal 
region. The expected number of events in the signal region results from a mea¬ 
surement of Nqff events in a dedicated background region. Different methods 
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(see Q) exist to construct the background region for a given signal region such 
that the event acceptances in the signal and the background region are equal. 
If the event acceptances in the signal and the background region are equal, the 
normalization factor a is equal to the ratio of the exposures in the signal and 
the background region. 

The statistical significance of a 7-ray signal, A = Aon — a Aoff, is derived in 
[2] to be 

«5 , LiMa(-^ON, Aoff, a) = sign(A)y ■-2 ^Aon ln( ^° N ) + Aoff ln( 

( 1 ) 

where Aon = ckAoff and 

AW = AW(a) = Non + N °? f (2) 

1 + a 


The normalization factor a is taken to have a negligible error, a a , in the deriva¬ 
tion of Eq. |T| in Q. 

The order of magnitude of the statistical error on the 7-ray signal due to 
Poisson fluctuations of Aon and Aoff can be estimated to be ABackground ~ 
\/Aon + ck 2 Aoff- When no signal is present, it holds Aon ~ a Aoff which 
leads to ABackground ~ \Jct(a + 1) Aoff- A similar back of the envelope es¬ 
timation for the error on the 7-ray signal, propagated from an error on the 
normalization factor, leads to A„ ss u a AoFF- 

The application of AidMa t° calculate the statistical significance of a 7-ray event 
signal is justified if A a <C ABackground- Using the back of the envelope estima¬ 
tion for the case where no signal events are measured in the signal region, the 
condition A a <C ABackground translates into the condition 


O' ot 

— < 

a 


1 + a 1 


( 3 ) 


a Aoff 

for the relative error on the normalization factor. 

The following discussion focuses on data acquired with imaging atmospheric 
Cherenkov telescopes (IACTs). However, .SuMa is also frequently applied in 
analyses of data from ground based water Cherenkov telescopes (see e.g. 
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and |4() and extensive air shower arrays (see e.g. 

The High Energy Stereoscopic System (H.E.S.S.) is an array of IACTs that op¬ 
erates in the Namibian Khomas Highland since 2003. Observations of the Crab 
nebula with H.E.S.S. result in the detection of approximately 100 background 
events that pass standard Hillas 7 -ray event selection criteria per 30 min obser¬ 


vation time for a normalization factor of a = 0.2 (see [ 6 | for details). Similar 
background event rates hold for analyses of data from observations of point like 


B 


7 -ray sources with other current generation IACT arrays such as MAGIC 
and VERITAS B- According to Eq. [3l the relative error on the normalization 
factor must be much smaller than cr a /a ~ 25% when SbiMa is used to calculate 
the significance of a 7 -ray signal with Aoff ~ 100 and a = 0.2. More precisely, 
it will be shown later that in this case cr a /cx must be known to about 3% to 
justify the application of SbiMa- 


B). 


For the planned Cherenkov Telescope Array (CTA, see [9fl), the increase in the 
number of telescopes will (compared to current generation IACTs) lead to a 
factor of 10 larger effective area. An increase of the background event rate 
for CTA of a similar factor of 10 (again compared to current generation IACT 


arrays) is expected from the enlarged effective area (see [n|). Consequently, 
approximately 10 3 background events per 30 min observation time are expected 
in a typical point source analysis of CTA data with a = 0.2. In this case, it is 
estimated with Eq. [3] that the relative error on the normalization factor must be 
much smaller than 8 %. Otherwise, an error on the normalization factor must be 
considered. Again more precisely, it will be shown later that the relative error 
on the normalization factor must be known to about one order of magnitude 
better than 8% for the application of ShiMa in this situation. 

In analyses of extended 7 -ray sources (e.g. Supernova Remnants, Q) or galac¬ 
tic dark matter searches (e.g. 12 |), the increased size of the signal region leads 
easily to background event rates which are an order of magnitude larger than for 
point source analyses. It is obvious that in those cases, the normalization factor 
a must be known with an even better precision than in point source analyses. 
It is arguable whether the normalization factor is in general known with the 
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precision that is required for the application of iSbiMa- This holds in particular 
for analysis of data from the planned CTA experiment. 

This paper extends the method for the calculation of the statistical significance 
of a 7 -ray signal first proposed in Q to include an error on the normalization 
factor. In addition to other authors, who discussed the same problem (e.g. [till] . 
[hi| ), a simple expression for the calculation of the significance of a measured 7 - 
ray signal is derived. Moreover, the criterion given by Eq. [3] to decide whether a 
given error on the normalization factor must be considered in the calculation of 
the statistical significance of a 7 -ray signal is tested in Monte Carlo simulations. 
The structure of the paper is as follows: The effect of the neglection of a sys¬ 
tematic error on the normalization factor on SbiMa is quantitatively discussed 
in section [2] A modified expression for the significance calculation, which con¬ 
siders an error on the normalization factor, is derived in section [3] 

Section Q] discusses more specifically the influence of systematic errors on the 
calculation of statistical significances in VHE 7 -ray astronomy with IACTs like 
H.E.S.S. or CTA. 


2. The Li and Ma Significance with a Gaussian Distributed Normal¬ 
ization Factor 

Figure [I] shows the distribution of the significance as calculated with SbiMa 
when Aon = Pois(a 6 ) and IVoff = Pois(&) are independently Poisson dis¬ 
tributed with b = 500 events. No signal events are simulated and it is expected 
that the Gaussian fit of the distribution of the significances results in a distri¬ 
bution which is compatible with being a standard normal. The distribution of 
the significance is shown in Fig. |T] for a fixed normalization factor, a = 0 . 2 . 
Additionally shown is the distribution of SbiMa for two Gaussian distributed 
normalization factors. The mean of the assumed distributions of the normal¬ 
ization factor is a = 0.2 in both cases. Respectively, the relative widths of 
the Gaussian distributions for the normalization factor are a a /a = 8% and 
a a /a = 15%. The assumption of a Gaussian distributed normalization factor is 
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Figure 1: Distribution of SLiMa (Eq. [TJ when TVon = Pois(a6) and ATqff — Pois(6) are 
independently Poisson distributed with b = 500 events. The normalization factor, a , is 
distributed like a Gaussian with mean 0.2 and relative width according to the legend. The 
width of the Gaussian fit to the significance distribution is compatible with being one if the 
error on the normalization factor is vanishing. However, the width of the Gaussian fit to the 
significance distribution increases with the relative error on the normalization factor. 
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Figure 2: Fitted Gaussian width of the distributions of the significance as calculated with 
■SLiMa (Eq. [T| red points) and ^Modified (Eq. 1121 black points) as a function of S (see Eq. [4l l. 
The parameters TVqfFi a and a a /a are randomly distributed within JVqff £ [100,10000], 
a 6 [0.1, 2] and a a /a € [0,15%]. 


here and in the following restricted to small relative errors on the normalization 
factor (a a /a < 15%). For large relative errors on the normalization factor, the 
distribution cannot in general be assumed to be Gaussian, e.g. because the 
normalization factor must be larger than zero for physical reasons. However, 
the assumption of small relative errors on the normalization factor is reasonable 
because large relative errors can easily be identified and corrected for in analy¬ 
ses. 

The mean of all fitted significance distributions that are shown in Fig. [I] is 
compatible with zero. However, the width of the Gaussian fit to the significance 
distribution is only compatible with being one when the error on the normal¬ 
ization factor vanishes. For non-zero relative errors on the normalization factor, 
the width of the Gaussian fit to the significance distribution increases with the 
relative error on the normalization factor. In other words, the absolute value of 
the significance is overestimated by ^LiMa when the relative error on the nor- 
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malization factor is not vanishing. 

The examples in Fig. |T] hold only for special values for the mean number of 
background events and the normalization factor. In general, the criterion for 
the applicability of SbiMa given in Eq. [3] translates into the condition 


Nqff 


< 1 . 


( 4 ) 


y a(a + 1) 

Figure [2] shows the width of the Gaussian fit to the distribution of the signif¬ 
icance as a function of S. Here, the parameter 5 is calculated for a random 
selection of a , er Q and Noff- The simulated number of background events, 
IVoff, is Poisson distributed with uniformly distributed means in between 100 
and 10000. The normalization factor, a, is Gaussian distributed with uniformly 
distributed means in between 0.1 and 2 and with uniformly distributed relative 
widths in between cr a /a = 0 and a a /a = 15%. Figure [2] shows that the width 
of the Gaussian fit to the distribution of SbiMa is indeed larger than one if Eq. 
[Jdoes not hold. More specifically, Fig. [2] shows that the width of the Gaussian 
fit to the distribution of SbiMa becomes larger than one if E > 0.1. 

'S'LiMa is, under the assumption of a vanishing error on the normalization factor, 


constructed in [ 2 ] to be standard normal distributed when the null hypothesis 
is true, i.e. when no signal events are present. In turn, .ShiMa is not applica¬ 
ble when the systematic error on the normalization factor causes a distribution 
of 5LiMa which is not compatible with being a standard normal distribution 
although the null hypothesis is true. This means that SbiMa is applicable if 


S <0.1 


( 5 ) 


is fulfilled. 

Equation [5] translates into the condition <j a /a < 0.1 ^/{a + 1)/(a ./Voff) for the 
relative error on the normalization. This condition is more precise than Eq. 
[3] Also, this condition was already used in the introduction when the maximal 
relative error on the normalization factor that allows the application of SbiMa in 
typical point source analyses with current IACTs and the planned CTA detector 
was estimated. 
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For cases where a systematic error on the normalization factor cannot be ne¬ 
glected, a modified expression for the calculation of the statistical significance 
of a 7 -ray signal is derived in the next section. 


3. Modified Significance for a Gaussian Distributed Normalization 
Factor 

Consider, similar to the approach in Q, the likelihood function 


L(Non, Noff, a|s, b, a*) 

= Pois(iVoN, ot*b + s ) Pois(iVoFF, b) Gaus(a*, a, a a ) . 


( 6 ) 


Again, this likelihood function is in this work assumed to describe the case of 
small relative errors on the normalization factor, i.e. a a /a < 15%, to avoid e.g. 
negative a *. The general likelihood function given by Eq. [G] has already been 
investigated in the literature (see Q, lsj], jl4|). However, no simple result for 
the statistical significance of a measured signal under the assumption of Eq. [ 6 ] 
was derived before. Also, no simple criterion to decide whether Eq. [S] should be 
applied was stated before. This is done in the following. 

The likelihood-ratio test statistic is used with the likelihood function given by 
Eq. [ 6 ] to compare the null hypothesis, i.e. that no signal is measured (s = 0), 
with the alternative hypothesis (s / 0). The likelihood for the null hypothesis 
is 

L 0 = Pois(NoN, a*b) Pois(iVoFF> b) Gaus(a*, a, a a ) (7) 


and the two nuisance parameters b (i.e. the mean number of background events) 
and a* (i.e. the Gaussian distributed normalization factor with mean a) are 
eliminated with the profile likelihood method (see e.g. jlfilp. The profile likeli¬ 
hood condition dLo/db = 0 leads to b = Noff(&*) where 7Voff(o:*) is defined 
in Eq. [2] The second profile likelihood condition, dLo/dct* = 0, leads to the 
cubic expression 


A(a*) = a * 3 - a* 2 (a - 1 ) - a*(a - a 2 a N QFF ) - a 2 a N ON = 0 . ( 8 ) 





Equation [5] can have up to three real solutions. Since, for physical reasons, 
it^IVon > 0, it follows that A(0) < 0. Together with lim^.^oo A(a*) = oo, it 
follows that there is always at least one real and positive solution to Eq. [ 8 ] For 
example, if o a = 0, Eq. [5] has three solutions at a* = — 1 , a* = 0 and a* = a. 
Since Lq is not defined for a* = — 1 and Lq = 0 for a* = 0, the solution a* = a 
maximizes the likelihood for the null hypothesis. 

When (a + l ) 2 > 3<r 2 JVoff, the cubic equation A(a*) = 0 has in general a local 
maximum at a~ and a local minimum at a + where 

a ± = ^ (a — 1 ± \/(a + l) 2 - 3 o- 2 7V 0 ff) • (9) 

If additionally sign(A(a + )A(a - )) ^ 1, there are multiple real solutions to Eq. 
[HI In this case, a numerical method can be employed to search for respectively 
one solution to Eq. [S] in the intervals (0,a“], {a~,a + ] and (a + ,oo). Out of 
the up to three found solutions, the one solution which maximizes the likelihood 
given by Eq. [7]is to be selected. However, for numerical reasons, it is in practice 
better to chose the solution to Eq. [5] which maximizes 


In (./Von! IVoff! L 0 + IVon + -/Voff) 

= Non M«*) + ( Non + N orr ) In ( "°”. + + " 0,r ) - 


( 10 ) 


If Eq. [5] does not have local extrema or sign(A(a + )A(a - )) = +1, there is only 
one real solution to Eq. [8]which can be found numerically in the interval (0, oo). 
The parameters s = Non — a* b, b = Noff and a* = a maximize the likelihood 
function (Eq. [6|) and lead to the likelihood 


Li = Pois(A^oN, -/Von) Pois(A^oFF, -ZVqff) ■ 


1 


(ID 


<7 ty 27T 

for the alternative hypothesis. The alternative hypothesis has one more param¬ 
eter (s) than the null hypothesis. Following Wilks’ theorem 17], it is assumed 
that the likelihood ratio test statistic TS = — 21n(L 0 /Ai) is approximately dis¬ 
tributed like a ^-distribution with one degree of freedom. The validity of this 
assumption is justified with Monte Carlo simulations. The evaluation of the 
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likelihood ratio test statistic TS = — 21n(Lo/Ai) gives the modified significance 


SModified(-/VoN, -ZVqff, a, & a) = sign(AoN — aNo FF )VTS 


= sign(iVoN - oiNo FF ) \ Sl iMa (Noish Aqff, a*) + 


/ 


LiMa 



2 


( 12 ) 


Note that ^Modified —> ^LiMa when a* —► a , i.e. when cr a —> 0. This means that 
^Modified reduces to ^LiMa when systematic errors are negligible. 

Equation fl2l is tested in a parameter range (IVoff G [100,10000], a G [0.1,2] 
and a a /a G [0,15%]) which is in practice relevant for analyses of data from 
IACTs. The result is shown in Fig. [2] The distribution of ^Modified is compat¬ 
ible with being standard normal if Aon = Pois(a6) and Aoff = Pois(6) are 
independently Poisson distributed with fixed b for all tested combinations of a, 
-/Voff and <j a . In particular, the width of the Gaussian fit to the distribution 
of ^Modified shown in Fig. [2] as a function of 5 is compatible with being one. 

It is concluded that ^Modified is standard normal distributed when the null hy¬ 
pothesis is true and systematic errors are too large for the application SbiMa, 
i.e. if S > 0.1. For negligible systematic errors, ^Modified reduces to SbiMa- The 
range of possible application for ^Modified is thus larger than for .SbiMa- 

4. Example: Influence of Systematic Errors in IACT Analyses 

As an illustration for the relevance of the consideration of systematic errors 
in the calculation of signal significances, consider the case of an IACT. Fre¬ 
quently (see e.g. [l|, Q, lj]), a background region for a given signal region is 
constructed in IACT data analyses under the assumption that the background 
event acceptance is rotationally symmetric around the observation position. In 
this case, the normalization factor a is the ratio of the exposures of the signal 
and background region. However, imbalances of the event acceptances between 
the signal and background region lead in general to a deviation between the 
normalization factor and the exposure ratio. Typical reasons for the imbal¬ 
ances between the event acceptances in the signal and the background region 
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Figure 3: Comparison of ShiMa and ^Modified as a function of the number of background 
events per observation run for cr a /a = 3% (blue), cr a /a = 6% (red) and a = 1. The number 
of signal events is chosen such that SLiMa = 5. 

can e.g. be atmospheric differences, bright stars, varying night sky brightness 
or electronic problems. An order of magnitude for the deviation of the event 
acceptance from rotational symmetry around the observation position in an ob¬ 
servation with H.E.S.S. is estimated in l| to be 3%. Figure |3] shows ^Modified for 
an error of 3% on the normalization factor as a function of the number of back¬ 
ground events per observation run when the number of signal events is chosen 
such that ^LiMa = 5. It is evident that in a typical H.E.S.S. point source analy¬ 
sis with approximately 100 background events per observation run (see jsl), the 
influence of expected deviations from rotational symmetry on the calculation of 
the signal significance is almost negligible. However, it is also obvious that the 
influence of systematic errors becomes larger when the number of background 
events per observation run increases. This situation can easily occur in analyses 
of data from extended signal regions where a spatially larger background region 
is used than in point source analyses. As discussed in the introduction, approx¬ 
imately 10 3 background events are expected in a typical point source analysis 
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of data acquired with the planned CTA experiment ([9|, 
clearly that this increase of the expected number of background events per ob¬ 
servation run can decrease the source detection potential of CTA when analyses 
are performed under conditions that are otherwise similar to e.g. H.E.S.S. data 
analyses. 

Typically, multiple (i = 1... K) observations of a signal region are combined in 
an IACT analysis such that the exposure ratio ai is the same for all observations, 
i.e. at = a. However, deviations between the exposure ratio and the normal¬ 
ization factor from observation run to observation run lead to a distribution of 
normalization factors with mean a. Again, the distribution of normalization 
factors is assumed to be Gaussian with width cr a . For the combined dataset, 
the resulting error on the mean normalization factor is a a /y/K. The total num¬ 
ber of signal (./Von = E» -/VoN.i) and the total number of background events 
(-/Voff = Ei -^OFF.i) are used together with the exposure ratio a as normaliza¬ 
tion factor to calculate the significance of the total 7 -ray event signal. Consider 
the mean number of background events, {-/Voff) = 1/A'E,; Noffa- The crite¬ 
rion given by Eq. [5] for the application of S^iMa to data from one observation 
becomes 


Q 


1(1). Figure [3] shows 


-'Combined — 


-/Voff 


= cr Q 


(-/Voff) 


<0.1 


(13) 


y/Kya(a + l) “ a y a(a + l) 
when data from multiple observations are combined. Equation [13] means that 
given a normalization factor and an error on the normalization factor, the mean 
number of background events per observation is relevant to decide whether 
SuiMa is applicable or not when multiple observations are combined. If Eq. [Hi] 
is not fulfilled, Eq. [12] can be used to calculate the statistical significance of the 
combined signal as S , Modified(-/VoN, ./Voff, a, <j a /y/K). 

The argumentation given above doesn’t cover an important exception that con¬ 
cerns the case where an analysis is set up such that the exposure ratios are not 
equal for data from differing observations. This situation can in general only be 
treated with a likelihood function that is binned in the exposure ratios (see 13[) 
or even binned in more parameters that characterize the observation conditions 
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(see 


Q)- 


The considerations in this section show that it is, in particular for the planned 
CTA experiment, important to estimate the typical systematic error on the nor¬ 
malization factor for an observation run in an IACT data analysis. Depending 
on the mean number of background events per observation run, the exposure 
ratio and the estimated error on the exposure ratio, the usage of ^Modified is 
preferred to the usage of SbiMa for the calculation of the statistical significance 
of a 7 -ray signal. 


5. Conclusion 

The influence of systematic fluctuations of the normalization factor a on the 
calculation of the statistical significance of a 7 -ray signal was investigated. The 
discussion focussed on small relative fluctuations (cr Q /a < 15%) of a around 
its expected value. It was shown that the Li and Ma method derived in jsj 
for the calculation of the statistical significance should only be applied to data 
acquired in one observation if Eq. [5] is fulfilled. Similarly, if data from multiple 
equations are combined, Eq. [I3]must be fulfilled or SbiMa should not be applied. 
The absolute value of the statistical significance of a 7 -ray signal is in general 
overestimated if Eq. [5] or, respectively, Eq. [13] is not fulfilled. An alternative 
method (Eq. fl2l) for the calculation of the statistical significance was derived. 
This method takes random fluctuations of the normalization factor around its 
expected value into account. The alternative is simple to apply and reduces to 
the Li and Ma method if the systematic error on the normalization factor is 
negligible. 
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